Making DATR Work for Speech: Lexicon Compilation in SUNDIAL

نویسندگان

  • François Andry
  • Norman M. Fraser
  • Scott McGlashan
  • J. H. Simon Thornton
  • Nick J. Youd
چکیده

We present DIALEX, an inheritance-based tool that facilitates the rapid construction of linguistic knowledge bases. Simple lexical entries are added to an application-specific DATR lexicon that inherits morphosyntactic, syntactic, and lexico-semantic constraints from an applicationindependent set of structured base definitions. A lexicon generator expands the DATR lexicon out into a disjunctive normal form lexicon. This is then encoded either as an acceptance lexicon (in which the constraining features are bit-encoded for use in pruning word lattices), or as a full lexicon (which is used for assigning interpretations or for generating messages).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A generic lexicon tool for word model definition in multimodal applications

This paper describes a generic lexicon tool which uses lexical representations and finite state transducers enhanced by arithmetic operations in DATR to generate individual output formats from a general phonological feature based representation. The tool was developed in connection with the lexicon component of a diagnostic evaluation toolkit, BEETLE, for a linguistic word recognition system. T...

متن کامل

A Lexicalized Tree Ad- Joining Grammar for English. a Lexicalized Tree Adjoining Grammar for English. Automatic Acquisition of Datr Theories from Observations. Theories Des Lexicons: 6 Comparison with Related Work 5 Applying Lexical Rules

This paper shows how DATR, a widely used formal language for lexical knowledge representation, can be used to de ne an LTAG lexicon as an inheritance hierarchy with internal lexical rules. A bottom-up featural encoding is used for LTAG trees and this allows lexical rules to be implemented as covariation constraints within feature structures. Such an approach eliminates the considerable redundan...

متن کامل

ITRI-03-02 A large-scale inheritance-based morphological lexicon for Russian

In this paper we describe the mapping of Zaliznjak’s (1977) morphological classes into the lexical representation language DATR (Evans and Gazdar 1996). On the basis of the resulting DATR theory a set of fully inflected forms together with their associated morphosyntax can automatically be generated from the electronic version of Zaliznjak’s dictionary (Ilola and Mustajoki 1989). From this data...

متن کامل

A Large-scale Inheritance-based Morphological Lexicon for Russian

In this paper we describe the mapping of Zaliznjak’s (1977) morphological classes into the lexical representation language DATR (Evans and Gazdar 1996). On the basis of the resulting DATR theory a set of fully inflected forms together with their associated morphosyntax can automatically be generated from the electronic version of Zaliznjak’s dictionary (Ilola and Mustajoki 1989). From this data...

متن کامل

Some re ections on the conversion of theTIC lexicon into

The Traac Information Collator (TIC) 1 (Allport, 1988, 1989) is a prototype system which takes verbatim police reports of traac incidents, interprets them, builds a picture of what is happening on the roads and broadcasts appropriate messages automatically to motorists where necessary. Cahill and Evans (1990) described the process of converting the main TIC lexicon (a lexicon of around 1000 wor...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Computational Linguistics

دوره 18  شماره 

صفحات  -

تاریخ انتشار 1992